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Quantum coherence and quantum entanglement represent two fundamental features of non-classical systems 
that can each be characterized within an operational resource theory. In this paper, we unify the resource theories 
of entanglement and coherence by studying their combined behavior in the operational setting of local incoherent 
operations and classical communication (LIOCC). Specifically we analyze the coherence and entanglement 
trade-offs in the tasks of state formation and resource distillation. For pure states we identify the minimum 
coherence-entanglement resources needed to generate a given state, and we introduce a new LIOCC monotone 
that completely characterizes a state’s optimal rate of bipartite coherence distillation. This result allows us 
to precisely quantify the difference in operational powers between global incoherent operations, LIOCC, and 
local incoherent operations without classical communication. Finally, a bipartite mixed state is shown to have 
distillable entanglement if and only if entanglement can be distilled by LIOCC, and we strengthen the well- 
known Horodecki criterion for distillability. 


The ability for quantum systems to exist in “superposition 
states” reveals the wave-like nature of matter and represents 
a strong departure from classical physics. Systems in such 
superposition states are often said to possess quantum coher¬ 
ence. There has currently been much interest in constructing 
a resource theory of quantum coherence [1-11], in part be¬ 
cause of recent experimental and numerical findings that sug¬ 
gest quantum coherence alone can enhance or impact physical 
dynamics in biology [12-15], transport theory [2, 16, 17], and 
thermodynamics [18, 19]. 

In a standard resource-theoretic treatment of quantum co¬ 
herence, the free (or “incoherent”) states are those that are di¬ 
agonal in some fixed reference (or “incoherent”) basis Differ¬ 
ent classes of allowed (or “incoherent”) operations have been 
proposed in the literature [1, 3, 9-11] (see also [20, 21] for 
comparative studies of these approaches), however an essen¬ 
tial requirement is that the incoherent operations act invari- 
antly on the set of diagonal density matrices. Incoherent op¬ 
erations can then be seen as one of the most basic generaliza¬ 
tions of classical operations (i.e. stochastic maps) since their 
action on diagonal states can always be simulated by classical 
processing. Note also that most experimental setups will have 
a natural basis to work in, and arbitrary unitary time evolu¬ 
tions might be physically difficult to implement. In these set¬ 
tings, there are practical advantages to identifying “diagonal 
preserving” operations as being “free” relative to coherent- 
generating ones. 

One does not need to look far to find an important con¬ 
nection between incoherent operations and quantum entangle¬ 
ment, the latter being one of the most important resources in 
quantum information processing [22]. Consider the task of 
entanglement generation. This procedure is usually modeled 
by bringing together two or more quantum systems initially 
in a product state p ® cr and then applying an entangling joint 
operation. However, using only incoherent operations, this 
will not be possible unless either p or cr already possesses 
coherence. The reason is that when p ® cr is an incoherent 


bipartite state, any incoherent operation acting on both sys¬ 
tems will leave the joint state incoherent (and hence unen¬ 
tangled). On the other hand, if the joint state is |h-) |0), with 
1+) = Vl/2(|0) + |1)), then an application of CNOT yields the 
entangled state Vl/2(|00) + |11)). This example reveals that 
coherence, or at least coherent-generating operations, is a pre¬ 
requisite for producing entanglement. In fact, as Streltsov et 
al. have shown [23], every coherent state can be used for the 
generation of entanglement in a manner similar to this exam¬ 
ple. 

Notice that the transformation |-i-) |0) —> Vr/^dOO) + |11)) 
requires performing an entanglement-generating incoherent 
operation. To capture both coherence and entanglement in a 
common resource-theoretic framework, one must modify the 
scenario by adopting the “distant lab” perspective in which 
two or more parties share a quantum system but they are spa¬ 
tially separated from one another [22, 24]. In this setting, 
entanglement cannot be generated between the parties and it 
becomes another resource in play. When the constraint of 
locality is added to the incoherent framework, the allowable 
operations for Alice and Bob are then local incoherent op¬ 
erations and classical communication (LIOCC). The hybrid 
coherence-entanglement theory described here is similar in 
spirit to previous work on the locality-restricted resource the¬ 
ories of purity [25-28] and asymmetry [29]. We admittedly 
do not point to a specific biological or thermodynamic pro¬ 
cess as motivation for studying LIOCC - although, one could 
envision potential physical applications in certain coherence- 
enhanced transport networks where the nodes interact through 
classical signaling. Rather, we promote LIOCC as the natural 
setting to explore the interplay between coherence and entan¬ 
glement as resource primitives in quantum information theory. 
Lor example, how much local coherence and shared entan¬ 
glement do Alice (A) and Bob (B) need to prepare a particu¬ 
lar bipartite state using LIOCC (Lig. 1 (a))? Conversely, 
how much coherence and entanglement can be distilled from a 
given state using LIOCC (Lig. 1 (b))? The latter task can 
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Figure 1. (a) An LIOCC formation protocol asymptotically gen¬ 

erates an arbitrary state from an initial supply of local coherent 
bits (Oa/®b) and shared entanglement bits (Oa'b')- (b) An LIOCC 
dilution protocol performs the reverse transformation. 

also be seen as type of collaboative randomness distillation, 
where Alice and Bob work together to generate local sources 
of genuine randomness for each other [6]. 

Our main results are the following. (1) We completely 
characterize the achievable coherence-entanglement rate re¬ 
gion for the task of asymptotically generating some pure state 
(xjieorem 1). (2) We introduce a new LIOCC mono¬ 
tone that combines both coherence and entanglement mea¬ 
sures (Theorem 4), and we show it quantifies the optimal rate 
in which Alice and Bob can simultaneously distill local co¬ 
herence from a pure state. (3) We identify an achievable rate 
region for the coherence-entanglement distillation of a pure 
state and show optimality at almost all comer points (Theo¬ 
rem 5). (4) In analogy to Refs. [25-28], we introduce and 
compute for pure states the nonlocal coherence deficit and the 
LIOCC coherence deficit (Eqns. (8)-(9)). (5) We show that 
LIOCC operations alone are sufficient to decide whether en¬ 
tanglement can be distilled from a mixed state using general 
LOCC. 

Let us begin by briefly describing the theory of bipartite 
coherence in more detail. Assigned to both Alice and Bob’s 
system is a particular basis called their incoherent basis. We 
denote Alice’s incoherent basis by and Bob’s inco¬ 
herent basis by so that the incoherent basis for their 

joint system ®'K® is {|.r)'^ *■ Then any bipar¬ 

tite state belongs to the set of incoherent states I iff it has the 
form 

^ Pxy\x){x\^ ® lyXyl^- (1) 

xy 

Following the framework of Baumgratz et al. [3], a local inco¬ 
herent operation for Alice is given by a complete set of Kraus 
operators {Ka]a such that {Ka ® ® 1^)"^ltr{KaKa ® 

jBpAB-^ e I for all p^^ e I. If ever she introduces a local 
ancilla system , the incoherent basis for this additional 
system is labeled in the same way {|x)'^ )^=o *■ Analogous 
statements characterize the notion of incoherent operations on 


Bob’s system. In the LIOCC setting, Alice and Bob take turns 
performing local incoherent operations and sharing their mea¬ 
surement data over a classical communication channel. 

The canonical resource states in the bipartite LIOCC frame¬ 
work are the maximally coherent bits (CoBits), |<E)a) ;= 
VT72(|0)'^ -I- 11)'^) and IOb) ;= VT72(|0>^ -H |1)*) for Alice 
and Bob’s systems respectively [3], as well as the entangled 
state 1*5x5) Vl/2(|00) + |11)), which we will call the max¬ 

imally coherent entangled bit (eCoBit). Notice that unlike en¬ 
tanglement theory, only those bipartite states related to |< 1 )a 5 ) 
by an incoherent local unitary transformation can be regarded 
as equivalent to |<1 >xb)- Fftr example, as we will see below, 
one eCoBit cannot be incoherently transformed into the state 
Vl/2(|0-i-) + |1-)), even asymptotically. 

We now describe the primary tasks studied in this paper, 
which can be seen as the resource-theoretic tasks recently an¬ 
alyzed by Winter and Yang in Ref. [7] but now with additional 
locality constraints. All of the detailed proofs can be found in 
the Supplemental Material, and here we just present the re¬ 
sults. Let us begin with the problem of asymptotic state for¬ 
mation shown in Fig. 1 (a). A triple {Ra,Rb, E™) is an achiev¬ 
able coherence-entanglement formation triple for the state p^^ 
if for every e > 0 there exists an LIOCC operation £. and in¬ 
teger n such that 

_£^(j)®rn(KA+e)l igi ^ ^®rn(£“+e)l^ ^ p»n 

Dual to the task of formation is resource distillation, as de¬ 
picted in Fig. 1 (b). A triple {Ra,Rb,E‘^‘’) is an achievable 
coherence-entanglement distillation triple for p^^ if for every 
e > 0 there exists an LIOCC operation £. and integer n such 
that 

XOo®") ^ (1)®^"'*^^'^’^ ® (l)®L«(«B-f)J g, 

As we are dealing with asymptotic transformations, we should 
expect the optimal rate triples to be given by entropic quan¬ 
tities. Recall that for a bipartite state the von Neu¬ 
mann entropy of, say, Alice’s reduced state co^ is given by 
S{A)a, - -tr[w'^ log w'^]. The quantum mutual information 
of takes the form I(A : B)^ := 5(A)(j - 5(A|B)(^, where 
S (A|B)(j := S {AB)^ - S (B)w For a pure state the en¬ 
tropy of entanglement E('l') 5(A)4< = S(B)^ is the unique 

measure of entanglement in the asymptotic regime [30], and 
it can be generalized to mixed states as the entanglement of 
formation Ef(p) [31]. We will also be interested in these en¬ 
tropic quantities after sending our state through the com¬ 
pletely dephasing channel, A(fa;) := Zijy |xy){xy|w|xy){xy|. It 
will be convenient to think of A(w) as encoding random vari¬ 
ables XY having joint distribution p(x,y) = (xy| A(w) |xy). For 
this reason, we follow standard convention and replace the la¬ 
bels (A, B) {X, Y) when discussing a dephased state. 

Our first main result completely characterizes the achiev¬ 
able rate region for the LIOCC formation of bipartite pure 
states. 



















3 


Theorem 1. For a pure state the following triples are 

achievable coherence-entanglement formation rates 

(RA,RB,En = ( 0, , 5(X)a(>p)) (2) 

{Ra,Rb,E^'’) = {S{X)^^^), 5(F|X)a(^), E('P)) (3) 

{Ra,Rb,E^'’)^{Q,0, S(XY)a(^)) (4) 

as well as the points obtained by interchanging A <r^ B in 
Eqns. (51) - (53). Moreover, these points are optimal in the 
sense that any achievable rate triple must satisfy (i) E™ > 
E(n (ii) Ra+Rb> S(XY)Am, (Hi) Rb + > 5(XF)a(Y). 

Eor a mixed state a formation protocol can be con¬ 
structed that achieves the average rates for any ensemble 
such thatp = Yjk Pk\‘Pk}{iPk\ [31]. Eor instance. One 
can consider an ensemble whose average bipartite coherence 
attains the coherence of formation Cf for p; i.e. it is an ensem¬ 
ble {pk,\(pk}'^‘^} forp that minimizes YjkPkSiXY)^^,^^) [6, 7]. 
Then for a mixed state p, the coherence rate sum Ra + Rb 
of Eq. (52) can attain the coherence of formation Cf(p). In 
the global setting where Alice and Bob are allowed to per¬ 
form joint operations across system AB, it has been shown that 
Cfip) quantifies the optimal coherence consumption rate for 
generating p using global incoherent operations [7]. Our re¬ 
sult then intuitively says that in the restricted LIOCC setting, 
the same coherence rate is sufficient to generate p, however 
they now need additional entanglement at a rate Yjk Pk^icpk), 
where the ensemble {pk, \ipk)^^] minimizes the average coher¬ 
ence of p. 

The proof of Theorem 1 uses two lemmas that may be of 
independent interest. The first generalizes a result presented in 
Ref [3], and the second is an incoherent version of Nielsen’s 
Majorization Theorem [32]. 

Lemma 2. An arbitrary dxd unitary operator U can be per¬ 
formed on a system using incoherent operations and ^log^f] 
CoBits. 

Lemma 3. Suppose andhave reduced density ma¬ 
trices that are diagonal in the incoherent bases for both par¬ 
ties and both states. Then \i//} \<p) by LIOCC iff the squared 

Schmidt coefficients of\(p) majorize those of If). 

Next, we introduce a new LIOCC monotone and provide 
its operational interpretation. To do so, we recall the re¬ 
cently studied task of assisted coherence distillation, which 
involves one party helping another distill as much coherence 
as possible through general quantum operations performed 
on the helper side and incoherent operations performed on 
the distillation side [33]. Eor a given state p'^^, the opti¬ 
mal asymptotic rate of coherence distillation on Bob’s side 
when Alice helps is denoted by Ca'^(p'^^). When the roles are 
switched, the optimal asymptotic rate is denoted by Cf^(p‘^^). 
It was shown in Ref. [33] that C^'^lp'^*) = 5 (T)a('i') and 
Cf'’^(p'^*) = 5(2f)A(>p)- With these quantities in hand, we de¬ 
fine for a bipartite pure state the function 

Cx('I') C;JI^('I') -H Cf'^(T') - E('I') 

= 5(X)Am + 5(T)A(>r)-E('I'). (5) 


Its extension to mixed states can be defined by a convex roof 
optimization [34]: C_c(p^^) = inf PkC_c(ip'^^) for 

which p'^^ = Y,k PkMiVkl 

Theorem 4. The function C^ is an LIOCC monotone. 

We note that this is the first monotone of its kind since it be¬ 
haves monotonically under LIOCC, but not general LOCC or 
even under LQICC, the latter being an operational class in 
which only one of the parties is required to perform incoher¬ 
ent operations (as opposed to LIOCC where both parties must 
perform incoherent operations) [33]. 

Using the monotonicity of C^, we are able to derive tight 
upper bounds on coherence distillation rates. 

Theorem 5. For a pure state I'P)'’'* the following triples are 
achievable coherence-entanglement distillation rates 

(Ra,R£,E“) = (5(X)Am - E('I'), 5(T)Am, 0) (6) 

(RA,RB,En = (0, 5(T|X)a(4<), I(X : T)A(>r)), (7) 

as well as the points obtained by interchanging A ^ B in 
Eqn. (65) and (66). Moreover, these points are optimal in the 
sense that any achievable rate triple must satisfy (i) Ra+Rb ^ 
Cx('I') and (ii) Rb -h £“ < 5 (T)Acr). 

This theorem endows Cx with the operational meaning of 
quantifying how much local coherence can be simultaneously 
distilled from a pure state. Eor a state I'T) the maximum that 
Alice can help Bob distill coherence is while the maxi¬ 
mum that Bob can help Alice is Cf'^. Evidently, they cannot 
both simultaneously help each other at these optimal rates. In¬ 
stead, they are bounded away from simultaneous optimality at 
a rate equaling their shared entanglement. 

It is still unknown the precise range of achievable distilla¬ 
tion triples (Ra,Rb,E™,jj), where Efj^^ is the maximum eCo- 
Bit distillation rate. While we are able to prove that Ef)^^ is the 
regularized version of liX : T)acp) optimized over all LIOCC 
protocols, we have no single-letter expression for this rate nor 
do we know the achievable local coherence rates for optimal 
protocols. 

A natural question is whether - E)*!*). While this 

question remains open, we can show that E)*!*) is achievable 
if the Schmidt basis of the final state need not be incoherent. 
More precisely, we say a number R is an achievable LIOCC 
entanglement distillation rate if for every e > 0, there exists an 
LIOCC protocol X acting on n copies of such that X('I'®") ^ 
A(/, where A^ is a t/ ® c/ maximally entangled pure state (i.e. 

-Ijd) with i logc/ > R - e. The largest achievable 
distillation rate will be denoted by £^^‘^‘'‘'('1'). 

Theorem 6. = E(T'). 

It is interesting to compare the coherence distillation rates 
using incoherent operations under different types of local¬ 
ity constraints. In Refs. [25-28], similar comparisons were 
made in terms of purity (or work-information) extraction. Let 
CGtobai^ qUocc^ (jUO (jgnote the optimal rate sum Ra+Rb 
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of local coherence distillation using global incoherent opera¬ 
tions, LIOCC, and local incoherent operations (with no clas¬ 
sical communication), respectively. In complete analogy to 
[25-28], we define the nonlocal coherence deficit of a bipar¬ 
tite state as 5{p^^) - and 

the LIOCC coherence deficit as - 

C^^ipfi^). Intuitively, the quantity 6{p^^) quantifies the co¬ 
herence in a state that can only be accessed using nonlocal in¬ 
coherent operations. Likewise, 6c(p‘^^) gives the coherence in 
p'^^ that requires classical communication to be obtained. The 
results of Winter and Yang imply that = S (YF)A(>i') 

and = S (Y)a(>p)-i- 5 (F)A(>i')-2E('I') for a bipartite pure 

state [35]. Combined with Theorem 5, we can compute 
the two coherence deficits for pure states: 

d)*!-) = E('P) - I(X : T)a(T) (8) 

5,('T) = E{'¥). (9) 

It is curious that the entanglement E('T) quantifies the coher¬ 
ence gain unlocked by classical communication. But note that 
a similar phenomenon exists in the resource theory of purity. 
Namely, the quantum deficit 5(T') and classical deficit 5e('P) 
measure the analogous differences in local purity distillation 
by so-called “closed operations” (CO), and they are given by 
d('T) = 4('T) = E('T) [25, 26]. Eor the task of distilling Co- 
Bits, every protocol using incoherent operations can be seen 
as one using closed operations by accounting for all ancilla 
systems at the start of protocol [36]. However, closed opera¬ 
tions allow for arbitrary unitary rotations, which are forbidden 
in coherence theory. The term I{X : T)A('r) in (5(T') identifies 
precisely the basis dependence in coherence theory and shows 
how this decreases the nonlocal coherence deficit (5('P) relative 
to 5(T'). On the other hand, there is evidently no basis depen¬ 
dency in the LIOCC classical deficit 5c('P) and it is equivalent 
to d,('P). 

Although our distillation results so far have only applied to 
pure states, we can deduce a very general result concerning 
the distillability of mixed states. 

Theorem 7. A mixed state p^^ has (LOCC) distillable en¬ 
tanglement if and only if entanglement can be distilled using 
LIOCC. 

The proof of this theorem is actually quite simple and uses 
the fact that an arbitrary quantum operation can be simu¬ 
lated using incoherent operations and CoBits (Lemma 2). In 
Ref [33] it was shown how local coherence can always be 
distilled for both Alice and Bob from multiple copies of every 
entangled states using LIOCC. Hence for a sufficiently large 
number of any distillable entangled state p^^, Alice and Bob 
first distill sufficient local coherence using LIOCC, and then 
they simulate the LOCC protocol which distills entanglement. 

As shown in Ref. [37], a state p has distillable entanglement 
iff for some k there exists rank two operators A and B such that 
the (unnormalized) state A ® Bp'^^A ® Z? is entangled. By The¬ 
orem 5 and following the same argumentation of Ref. [37], 
we can further require that the A and B are incoherent oper¬ 
ators; that is, they have the form A = |0)(aol + |1 )(Q'iI and 


B = |0><ySol + |l></Sil where A(ao) := A(|ao)<aol) is orthogonal 
to A(ai) A(|ai)(ai|), and likewise for A(fio) A([6o)</6ol) 
for A(jSi) := A(|/li)(/?i|). We are thus able to add an additional 
condition to the distinguishability criterion of Ref. [37]. 

Corollary 8. A bipartite state p has distillable entanglement 
iff for any pair of orthonormal local bases IB a - (|v)'^] and 
Bb — lly)^) there exists some k and projectors Pa - |ao)(Q'ol + 
|ai)(ai| and Pb - ^o)(/So| -i- |/?i)(/Sil such that 

1. (Pa ® Pb)p’^'‘(Pa ® Pb) A entangled, 

2 . tr[AA(Q'o)AA(Q'i)] = tr[AB(^o)AB(y?i)] = 0, 
where Az is the completely dephasing map in the basis 

Conclusion: In this letter, we have investigated the relation¬ 
ship between entanglement and coherence in the framework of 
local incoherent operations and classical communication. The 
findings of this study suggest that indeed entanglement and 
coherence are closely linked resources. Eor instance. Theorem 
5 shows that the entanglement of a state plays a crucial role in 
limiting the amount of coherence that can be distilled from a 
state, a result highly reminiscent of the complementarity be¬ 
tween local and nonlocal information studied in Ref. [27]. In a 
similar spirit. Theorem 7 shows that entanglement distillabil¬ 
ity can be studied through the lens of coherence theory. This 
latter result seems somewhat remarkable since despite coher¬ 
ence being a basis-dependent resource, its resource-theoretic 
analysis can be used to draw conclusions about entangle¬ 
ment, a basis-independent resource. Euture work will be con¬ 
ducted to see whether the strengthened distillability criterion 
of Corollary 8 can be useful in the long-standing search for 
NET bound entanglement. 

Einally, we would like to comment on the particular type 
of incoherent operations studied in this letter. As noted in the 
introduction, there have been various proposals for the “free” 
class of operations in a resource theory of coherence. This let¬ 
ter has adopted the incoherent operations (lO) of Baumgratz 
et al. [3], where each Kraus operator in a measurement just 
needs to be incoherence-preserving. While the class lO has 
drawbacks in terms of formulating a full physically consistent 
resource theory of coherence [11, 20], it nevertheless seems 
unlikely that the results of this letter would remain true if other 
operational classes were considered. Eor example, the strictly 
incoherent operations (SIO) proposed by Yadin et al. are un¬ 
able to convert one eCoBit into a CoBit [11]. Thus, we believe 
that the interesting connections between lO coherence theory 
and entanglement demonstrated in this letter make a positive 
case for why lO is important in quantum information theory, 
independent of any other motivation. In fact, one could even 
put coherence aside and view LIOCC as just being a simpli¬ 
fied subset of LOCC. As we have shown here, nontrivial con¬ 
clusions about entanglement can indeed be drawn by studying 
LOCC from “the inside.” This approach is somewhat dual to 
the standard practice of studying LOCC using more general 
separable operations (SEP), the chain of inclusions being LI¬ 
OCC c LOCC c SEP. Interesting future work would be to 
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consider more general connections between coherence non¬ 
generating and entanglement non-generating operations. 

During preparation of this manuscript, we learned of work 
by Streltsov and co-authors who have also initiated a study 
into local incoherent operations and classical communication 
[38], 
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Supplemental Material 

PRELIMINARIES 
Distance Measures 

The distance measure used in this paper is based on the trace norm, which for an operator A is ||A|| triA| = tr VA' A. The 
trace distance for two states p and cr is Dtr(p, o") = j \\p - o"!!. and we will write p « cr to indicate D,r(p, cr) < e. The fidelity of 

two states - given by F(p, cr) = tr ^ -sjpcr - can be related to the trace distance by [46] 

1 - F(p, cr) < D,r(p, cr) < - Fip,cr)^. 

When cr is pure, the lower bound can be improved; 

l-E(p,|^)<^|)2<D,,(p,|^)<^|). (10) 

We reference Fannes’ inequality, which provides a bound on entropy difference in terms of the trace distance. 

Lemma 1 (Fannes-Audenaert Inequality [47, 48]). For density matrices p and cr acting on a d-dimensional space, 

15 (p) - 5(cr)| < D,Ap, 0-) log(d - 1) + h {D,r(p, cr)), (11) 

where h(x) — —xlogx — (1 — x) log(l — x). 

Finally, we will need Winter’s gentle measurement lemma. 

Lemma 2 (Gentle Measurements [49]). For p > 0 and trp < 1, suppose 0 < A < I and trpA > 1 - e. Then \\p- yfXp VA|| < V^. 


Types, Typical Sequences, Channel Coding 


In what follows, we assume that random variables A, T, ■ ■ ■ take on values x, y, ■ ■ ■ from sets A, J/, ■ ■ ■. Probability distributions 
will be denoted by p or q. For n identical and independently distributed (i.i.d.) events each with outcome distribution p, the 
distribution over the sequence of events is denoted by p". See Refs. [50, Chapter 2] and [51] for a comprehensive presentation 
of the following concepts. 

The type of a sequence x" e A" is the distribution p^, over X defined by 

Px«(a) ;= -N(a\x") Va 6 X, 
n 


where N(a\x") is the number of occurrences of the symbol a e X in the sequence x". For a given distribution p, the collection of 
all sequences having type p is called the type class of p and is denoted by T". A distribution p is said to be an empirical type 
(for some n e N) if T” is nonempty. 

A sequence x" is said to be d-typical (or just typical) w.r.t. distribution p if 


A(a|x") - p{a) 


<6 


Vae A. 


The set of all d-typical sequences will be denoted by Note that the set is the union of empirical type classes, and 
hence we will say that a distribution q is typical w.r.t. p if it is an empirical type with c 
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Three standard properties of typicality are the following. First, for n i.i.d. samples of X according to distribution p. 


p"(7'pJ:=Pr[x"6 7'(;]J>l-e (12) 

for any e,d > 0 and n sufficiently large. Second, let X be a random variable having distribution p and entropy H(X) := 
- Yjoex P(^) loS Then the size of can be related to H(X) as 


-log IT, 


[ph' 


■H{X) 


< e 


(13) 


for any e,6 > 0 and n sufficiently large. Third, if o € T" . then (for 5 < {2\X\) ') 

\.P\6 

(n + < IT’"! < 2«(«W+h-5))^ (14) 

where t{S) - -cJI/YI log 5. Note that t{S) is a function increasing in 6 for the range 0 < 5 < {2\X\) '. Eq. (14) follows from an 
application of Lemma 1 to the inequality 


(n + i)-i'^i2""® < ir;;! < (i5) 

Moving to the quantum setting, for a quantum system '77, we will assume throughout that the computational basis {|x)) is the 
incoherent basis. For an empirical type p, the corresponding type projector acting on “TY®" is given by 

np = Z 

veP” 

We restrict attention to CQ-channels ^cq ^ pf > which map each element |jc) to a density matrix p® acting on “TY. Note 

that CQ channels generalize classical channels. If we are given a set of transition probability p{y\x) characterizing a classical 
channel, then the corresponding CQ channel is \x){x\^ pj, where pj = YiyeH P(yk)l}')(yl- We will refer to channels of this 
form as CC channels ^cc- 

When a distribution p is given over X, we associate a quantum ensemble {p{x),p^]jcex with the channel 'Wcq, as well as a 
classical-quantum state p^^: 


P^^ = '^p{x)\x){x\^ ®pf. 

xeX 

The mutual information of the classical-quantum state p^^ is the so-called Holevo quantity and denoted by I(X : B) - 
S{YjxPix)Px) ~ Yjx Pix)S ip^). ForCC channels, the associated joint state is fully incoherent; 

p^^^ 2 p(x,y)\x){xf ®\y){yf, 

with mutual information I{X : Y) = H(X) + H{Y) - H{XY). 

Recall that an (n, e) code of size C for a CQ channel TVcq ^ \x){x\^ ^ p^ is a sequence of codewords with e X'' 

and a POVM acting on ”77®" such that 


^ytr^,"^<,D,]>l-e, (16) 

C=1 

where if = x", then p^,^, P% ® ■ ■ ■ ® p* e <^®n 

Coding Theorems 

We now introduce the information-theoretic machinery that provides the foundation for our coding schemes. The following 
is adopted from the work of Devetak and Winter in Ref. [52]. For an empirical type p over X, let ({/^^^^) be an i.i.d. sequence 
of random variables obtained by sampling from T" uniformly, with I - 1, • ■ • ,L and c - 1, ■ • • , C. We consider the following 
events: 
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• e-evenness: For all x” e T^, 




( 17 ) 


where Ij/a^ is the indicator function for whether = x". 

• {C/lfli (n, e) codes: For every I = 1, ■ ■ ■ ,L the codebook C/ ;= forms an (n, e) code for the channel ^cQ ^ 

\x){x\^ pf. 

Lemma 3 ([52-54]). Consider a CQ channel 'Wqq '■ pf and a random variable X with a distribution given by some 

empirical type p. Let be an i.i.d. sampling from T" with I — 1, • • • ,L,c— 1, • • • , C, and Ci = (t/^^‘^^)^^j. For every d, e > 0 

and n sufficiently large. 


-LC - - - 

Prje-evenneii) > 1 - |A’|"2 2 in 2 |r;;i ^ 

p 

C < Pr{A fraction 1 - 2e of the {C;)^[ are (n, e) channel codes) > 1 - 2^'^4in2. (18) 

Corollary 4. Consider a CQ channel "IVcQ • l■^)(2c|^ —* and let X be a distribution given by some empirical type p. For 
sufficiently large n and 6 < I(X : B), there exists a partition ofTf, such that a fraction 1 — 3e of the sequences in Tf, belong to an 
(n, e) channel code Ci, • • • , Cl, where L - |'2"(^®'2(2f:B)+2<5)-| each Ci consists ofC - [2"F(2t:B)-'5)j codewords. 

Proof Since by the choices of L and C, 

— > 2""* + ^ oo 

ir^i - 

then by Lemma 3 an i.i.d. sequence will satisfy 

Pr{e-evenness) ^ 1 

Pr{A fraction 1 - 2e of the C/ are (n, e) channel codes) ^ 1 

as n ^ CO. Thus for sufficiently large n, there must exist families of (n,e) codes c T'^ for AVcq that cover T" with a 

fraction 1 -2e of these codes being (n, e) channel codes. The union of these codes will consist of (1 -2e)LC codewords, including 
multiplicities. But by e-evenness, the number of distinct codewords in this union will be at least > (1 -3e)|7'^"|. Indeed, 

e-evenness guarantees that each individual x" has multiplicity no more than (1 -i- e)^. among all the codebooks. Hence, the 

M pi 

number of distinct codewords is at least , and so at least a fraction (1 - 3e) of the elements of T” are codewords for an 

in, e) code. □ 

With Corollary 4, we are able to almost entirely cover each type class by (n, e) channel codes having a constant rate C. We 
will also be interested in decomposing T" into “obfuscation” sets. The following covering lemma is presented in [7]. 

Lemma 5. Consider a CQ channel "TFcq • —* on a d-dimensional Hilbert space and a random variable X with a 

distribution given by some empirical type p. Let be obtained by a uniform sampling without replacement from the set 

{p"„ : x’' e T"). Define the average state 


o-(p) 


IT"I XI 

ver; 


Then for every e,6 & (0,1) and n sufficiently large. 


Pr 


i |; n. - ^wil > e| < Id- exp (-S 


(19) 


where t - 2 7(X:B)+5)_ 

We will say that a collection of states corresponding to S distinct sequences from Tf, is “good” if || - cr(p)|| < e. 
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Figure 2. Code Depiction. Our code involves first decomposing the typical set into its typical type classes where p{x) is distribution 

given by I'F^ = xfpUd Vl^x)^ for Alice’s incoherent basis Ix)*^. Each type class is further decomposed in three different ways. The 
first involves a partitioning into obfuscation sets S,„ for which Bob’s average state is roughly the same when restricting to these sets. The other 
two decompositions involve partitioning into codebooks C/ and C/ for the channels |x)<x| —> and |x)(x| —> \\l/x){4'x\ respectively. 


Corollary 6 ([7]). Let X be a random variable with distribution given by some empirical type p. For n sufficiently large, there 
exists a partitioning ofT" consisting of M subsets [S each of size S = (plus a remainder) such that a fraction 

(1 — e) of the subsets are good sets. 

Proof Consider a random partition of into M blocks each of size S. Note that each block is equivalently obtained by a 
uniform sampling of S elements from without replacement. Let be the random variable for which = 1 if the m’^ 
block is a good set and = 0 if it is not. By applying Lemma 5, n can be taken sufficiently large so that the expectation of 
Em is greater than 1 - e. Therefore for a random partition of F”, the expected number of good sets across all blocks is given by 
(2m=i ^m) - > M{\ - e). Hence, there must exist at least one partition with M(1 - e) of the blocks being good. □ 


CODE STRUCTURE 

We combine Corollaries 4 and 6 to obtain the basic structure of both our distillation and formation codes. A diagram is 
provided in Fig. 2. 

Let be an arbitrary bipartite state with 


= 2 ( 20 ) 

xeX 

where (Ix)"^) and {|y)^) denote the preferred bases with respect to which the incoherent operations are defined for Alice and Bob, 
respectively, and = ZiveJ/ sjp^\x) |y)* are normalized but not necessarily orthogonal states of Bob. Let 'Wqq and 'Wqc 
be the CQ and CC channels given by 


^CQ : IxXxl'' ^ 

^cc : kXxl'' ^ ^ J]p(y\x)\y){y\^. 


Let X be the random variable taking on values from X according to the distribution p(x) in Eq. (20). In other words, p(x) 
describes the distribution of outcomes when measuring A('T'^) in the incoherent basis. For a fixed n, the set of typical sequences 
is the union of typical types. We will denote the typical types by p,, for t - 1,2, • • • , F, and the random variable associated 
with pt will be denoted by X,. Note that T < (n + 1)I‘’‘L 

We will be interested in four different «-copy decompositions of where in all cases we assume that n is being taken 

sufficiently large. 
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Decomposition 1: The first decomposition is based on the coherence distillation protocol presented in Ref. [7]. It involves 
forming good sets Sm in the sense of Corollary 6 and w.r.t. the CQ channel ^cq- For each typical type pi, consider a partitioning 
of according to Corollary 6 . Then for each each x" € , we can relabel x" ^ {t,m, s) where p, is the typical type for 

which x” 6 Tp -, m is the block number within for which x" belongs (with m - Q labeling the small remainder block); and s 
is the order of x” in the block. For a fixed f, the range of m and s ism - 1, • • • ,M, and i = 1, • • ■ 5 where 


S — [2”^^ 1 


M, = Lir;j/5J, 


( 21 ) 


The bit rates of S, and M, satisfy 


-log5,-E('T) <0(T(d)) 


( 22 ) 


n 


1 


logM, - [ 5 (A)a( 4 .) - E('P)] < 0{t{5), ^). 

n 


(23) 


n 


The first line follows from Eannes’ Inequality and the fact that I{X : B)^ - E('I'), while the second can be seen from 2 log IT^ | « 
H{X) and 2 log 5, =» I(X : B)^. Since all but a vanishing small fraction of x” belong to , we can thus write 





(24) 


where qit) is the probability of typical type class T”^ (conditioned on the event x" e Note that for a sequence x" labeled 

by (f, m, s) we have that S (A(i/fy.)) = S — ZAex iV(x|x" 6 Tp )S with the RHS being independent of i and m. 

Since S (T|A)Acf) = p{x)S (A{if/x)), the following bound is obtained, 


i5(A(<^,,0) - 5(T|A)Am < 5 y 5(A(iAa)), 

n 


(25) 


which again follows from 5-typicality. Eor each typical type p,, we now further restrict the sum over m to only those values for 
which 5m are good sets. By Corollary 6 there are M,(l - e) such sets. Eor these values of m, we have that 



(26) 


I’l 


with the RHS being independent of m. As Uhlmann’s Theorem states that F(p\,p 2 ) - max K^i|i, 02 )l, where the maximization is 
taken over all purifications of p\ and p 2 respectively [55, 56], the previous equation implies for each pair (f, m) the existence of 
a unitary t/„„ acting on A 3 such that 



(27) 


where mo 6 {1, ■ ■ ■ , M,( 1 - e)) is some fixed number. We thus continue Eq. (24) by restricting the sum over m to just good values 
and replacing the sum over s with Eq. (27): 



(28) 


Decompositions 2 and 3: The second and third decompositions are built from (n, e) codes for the channels AVcc and 'Wqq re¬ 


spectively. The structure of the decompositions is based on the entanglement-assisted and GHZ distillation schemes of Ref. [57]. 


Eirst we turn to AVcc- For every typical type p,, consider a partitioning of according to Corollary 4. Then each x" 6 
can be relabeled x" —» (f, l,c) where p, is the typical type for which x" e T^; I is the code C; within Tj] for which x" belongs; 
and c is the order of x" in the code. Eor a fixed f, the range of / and c is 1 = 1, • • • ,L, and c - 1, • • ■ C,, where 






( 29 ) 
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The bit rates of L, and C, satisfy 


-logL,-[//(X)-/(X: T)Am] 
n 


-logC,-/(X: T)a(4.) 
n 


log n 

< 0(t(S), 

n 

< 0{t{6)). 


(30) 

(31) 


By discarding non-typical sequences and the fraction 3e of x!' not belonging to an (n, e) code, we obtain the approximation 

T 


I'D®" - ^ 


1 1 




Vi,(l - 3e) 


(32) 


For every (f,/) define the state \xti) - \^tic)^- By e-decodability of the channel ^cc there exists a family of 

decoding POVMs such that 


C, 


£tr[A(iArZc)£>?^] > 1-e. 


Note that tr[A(t/r,;e)D®] = tr[A((/r,;e)A(D^^^^)], and so without loss of generality, we can assume that the are diagonal in the 
incoherent basis. We consider a dilation of the (f, 1)'^ POVM. To do so, introduce the isometries Y : B ^ BB\ and lT,z : B —» BB 2 
as 


C, 


Wu = ^ ® |c> 


(33) 


Crucially, both Y and Wti represent incoherent operations. Define the state 


\X,i} = (I^^ ® W,iY) \x„} 

c, c, 


^ ^ Z Z l/)*‘ k'>*^ ■ 




(34) 


We want to show that this state is e-close to the state 

^n) = ^ Z Z ’ 

VCr 


(35) 


which would imply that Bob coherently decode |c) from \xti) without disturbing the state that much. To this end, first note that 
'iXti) (I^" ® yp^i) iTf/) where 


c, 


X,i = Yj ® ® k><c|^ 

C=1 


0(e) 

The approximation here can be seen from the fact that 


- 1 

(Xri\ (I^^ ® V^) Pf,z> = TT y tr[A((^,z,)D<''^] > 1 - e. 


Then applying Lemma 2 to tr[(D' ®Xti)x,i\ > 1 - e, we can conclude that (D' ® V^) \Xti) \Xti)- Therefore, \xii) Ixti) und 
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so 


I'i'r ""-^2 


L,(1-36) . 

i V"’ .,. 4. i 


/=1 


Z ^ Z Z (l/)"* 


Vir(l - 3e) VG 


e=l v"ej/'' 


1 


= 2 j 

/=1 

T 

- V V^k)' ;_ 

ylL,(l - 3e) 




V-L,(l - 3e) 
1 


/=! VCV 

L,(1-3£) , C, 


c=l 


(36) 


where n,/c permutes |iAr;„eo) into \>j/tic), for some fixed Zq e 1, ■ ■ ■ , L, and cq e 1, ■ ■ ■ , C,. Recall that for each type t, each li/'tic) is 
a sequence lifr,ic} = liAx,) IiAaj) •'' li/^x„) related to one another through a permutation of the |iAx,)- 

We now repeat an analogous decomposition for the CQ channel ^cQ- Since this will involve a different covering of the type 
classes we use a different labeling x" —> {tj,c). By the same arguments as above, the decomposition takes the form 


?=i 




a/^ 


(1 - 3e) U\ 


C, c=i 


(37) 


Here, like before, Wn is an isometry for the (f, 1)*^ decoding POVM of ^cq as in Eq. (33). However Wti will in general not be 
incoherent. 

Decomposition 4: The fourth decomposition is a hybrid of decompositions 1 and 2. It begins with Eq. (24) and the fact that the 
sum over m includes M,(l - e) good sets in the sense that 




X 'A..- 


(38) 


for these good values of m. In this decomposition, we now replace the RHS by ^cc channel codes. That is, we use Corollary 4 
to write 


j j j Z./ Cf 

^ Z ~ Z “ izc, Z Z 


.?=! 


P‘' x-eT" 


l=l c=\ 


Uhlmann’s Theorem again implies that for each good value of m there exists a right orthogonal matrix Vtm ' A 3 A 4 A 3 (with 
such that 


s, 




0(f) 1 


L, C, 




^^fTt c=l 

Hence by restricting to good values of m and (n, e) channel codes, we have the decomposition 

f M,(l-f) 


(39) 


I'i’r - Xi 


L,(1-3£) C, 


(40) 


Einally, similar to the construction in decomposition 2, decoding isometries Wti exist for Bob so that the state can be expressed 




1 


M,(l-f) 


r=l 


VM,(1 - e) 


X \mf 


L,{l-3e) C, 


m=l 


^JL,Q(1 - 3e) 




1=1 C =1 


for some fixed (/q, cq). 

Summary of Code Construction: 
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The four decompositions we will use are given by Eqns. (28), (36), (37), and (41). For convenience, we recall them here: 

M,(l-e) , S, 


I'D®” "t* ^ ^ 2 fj . 

^ VM,(1 - e) V57^ 




(=1 

r 


L,(l-3e) Cr 


(=1 

T 




^JL,Q(l-3e) ^ 

I'i'r ^ Z Z ^>1 (^rfe > 


(=1 

T 


yjLiCiil - 3e) z=i c=i 

Mr(l-€) 


(42a) 

(42b) 

(42c) 


^ VM,(1 - e) 


Z„(l-3e) C, 


VM,(1 - e) VDD(l-3e) ^ 




with bit rates 


-logT 

n 

-log5,-E(D 

n 

-logM,-[5(X)A(^)-E('T)] 

n 

-\ogL,-S{X\Y)t,(^) 

n 

-logC,-/(X: Daw 
n 

-5(A(iA.»))-5(T|DAm 

n 

-logZ;-[5(A)Am-E('P)] 

n 

-logD-E(D 

n 


< 0(!^), 

(43) 

< 0(T(d)), 

(44) 

logn 

< 0{t{6), -^), 
n 

(45) 

logn 

< 0{t{6), -^), 
n 

(46) 

< 0(t{6)), 

(47) 

< 0{6), 

(48) 

logn 

< 0{t{6), -^), 
n 

(49) 

< 0{t{6)). 

(50) 


PROOFS OF MAIN TEXT THEOREMS/LEMMAS AND EXPANDED DISCUSSION 

Proof of Theorem I 

Theorem 1. For a pure state the following triples are achievable coherence-entanglement formation rates 


{Ra,Rb,E^") = ( 0, ^(TIDaw , 5(A)Am) (51) 

{Ra,Rb,E^'>) = (5(A)Am, 5(T|A)Am, E(D) (52) 

{Ra,Rb,E^'>) = (0, 0, 5(AT)Am) (53) 

as well as the points obtained by interchanging A <-> T in Eqns. (51) - (53). Moreover, these points are optimal in the sense that 
any achievable rate triple must satisfy 

£™>E('P) Ra+Rb>S{XY)a(^) /?B + £“>5(AT)Am. (54) 

Example 1. Before getting to the proof, we give an example of the resource trade-off provided by Theorem 1. Consider 


the formation of = VT|+)|0) + Vl - 41-) |1) for 0 < 4 < 1/2. This state has entanglement E('T) = 5(A)>y < 1. 

The lower bounds of Theorem 1 say that any formation protocol that has E‘^° - E('T) must have coherence rates satisfying 
Rb S 5(AB)a(>p) - E)*!*) = 1. However, if Alice and Bob would rather use more eCoBits than CoBits, they can reduce Bob’s 
local coherence rate. Namely, rate (51) gives Ra -0 and Rb - S (T|A)a('P) < 1. 
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Proof. The lower bounds of Eq. (54) follow from the coherence cost rates using global operations (Theorem 3 of [7] as well as 
[ 6 ] and Theorem 5 below), and the fact that one eCoBit can be converted into one CoBit using LIOCC. Indeed, IO^b) ^ |Ob) 
when Alice performs the incoherent measurement with Kraus operators {|0)(+|, |1)(-|), and then Bob performs ctz iff Alice 
obtains outcome | 1 ). 

Moving to achievability, we first prove Eq. (51). The protocol is based on decomposition (42b). Alice and Bob share \ogE„ 
eCoBits expressed as 


T L,(l-3e) C, 




Z Z Zi''>"" 


IB)**' \ccy 


A,B2 


= 1 Z=1 c=l 


(55) 


where L,(l - 3e)C,, while Bob has an additional n[5(T|2f)A(>p) + (9(d)] CoBits. In the first step, Alice and Bob 

deterministically transform their state into 




—^— y 

Vir(l-3e) ^ 




iccf 


where q(t) is given in Eq. (42b). This can always be done via the majorization criterion of Lemma 3 below. Using his 
local CoBits, Bob first prepares Ifiigco)^ on ancillary system B, which can be done to arbitrary precision for sufficiently large 
n (Theorem 3 of [7] as well as [ 6 ] and Remark 2 below). Next he performs the incoherent unitary conditioned on 

\t)Bo ]y[ore precisely, 11,^ is performed on system B conditioned on B 0 B 1 B 2 , and is performed on BBx conditioned 

on BqBi. Einally, he decouples his registers BqBi. To accomplish this, he performs a generalized incoherent measurement 

I (f-l)(f'-l) ^ (Z-l)(/'-l) \ 

{K,j = where | 7 ,_,> = 2,^=1 ^ ^ i>(i-3e) / |/'^ Eor outcome K,j, Bob announces the 

result to Alice, and she performs the incoherent unitary 


T L,(l-3f) _ / (t-l)(Z'-l) (;-l)(/'-l) \ 

t'=l l'=l 


The desired state is thus obtained: 


T 


y v^if)^ 


1 

V-L,(l-3e) 


L,(1-3£) 

Z 


/=1 


-^Y.\cf^wl{nnc\f,kcof |c>*y 
VC, 


(56) 


Asymptotically, the consumption rates of entanglement and coherence approach Eq. (51). 

Now we prove achievability of Eq. (52). The protocol is based on decomposition (42a). Alice and Bob share logE' eCoBits 
expressed as 


T S 

^ y y , (57) 

y^n 

where £' = 5,, while Alice has an additional n[S (9f)A('f) + 0 ( t { 6 ), ^^)] CoBits and Bob has n[S (T|9f)A(ij<) + 0{5)] CoBits. 

In the first step of the protocol, Alice and Bob again deterministically transform their entanglement into 

T S 


using LIOCC. Using local CoBits, Bob prepares \ conditioned on After this, he decouples his registers B 1 B 3 using 

a measurement described above, and Alice performs a suitable incoherent unitary similar to Eq. (56). At this point, Alice and 
Bob share 


T S! 

y -7= y i<A,mos)^ ■ 

,=1 V 5 , 

Next, Alice splits her coherence into two parts |a:i) \k 2), where \Ki) - ZI-*) for (V)a('P) - £(*?) + 0{t{6), ^^)] and 

K 2 - n[E('l') + 0{t{6), ^^)]. Using |,fi), she implements a rotation |0) ^ conditioned on \t). Using \k 2 ), she 
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implements another unitary rotation |s) Utm k) conditioned on |f) |m), for s — 1, • • ■ ,5,. The desired state is thus obtained: 

T , 1 S, 




1 


?=1 


VM,(1 - e) 


2 \m)^^ £ (t/,™ \s}^^) . 

i=i 


m=l 


Asymptotically, the consumption rates of entanglement and coherence approach Eq. (52). 

Finally, the achievability of Eq. (53) follows from Eq. (51) and the fact that every eCoBit can be deterministically transformed 
into a CoBit for Bob using LIOCC. 

□ 


Proof of Lemmas 2 and 3 


Lemma 2. An arbitrary dxd unitary operator U can be performed on a system using incoherent operations and [log d\ CoBits. 
Proof. Let us introduce an orthonormal basis for a bipartite system S S' consisting of maximally entangled states 

(58) 

where 

j d-l d-l 

^ ® " Z ^''1^ + 

(=0 1=0 

with T = g 2 m/d addition is modulo d. The {\bjk}} generalize the Bell basis in higher dimensions. Note that each unitary Wjk 
is an incoherent operation. 

Suppose now that we wish to perform an arbitrary d ® d unitary U on some state \if}. We can accomplish this incoherently 
using [logt/] CoBits through the following procedure. First, let |i/f) belong to system S, and express = [ij/] where 

[i/r] is a X complex matrix and - Ij ^[d Yfifo 1*)'^ ■ We next introduce [log d\ CoBits on system S'. This is 

deterministically transformed into = 1/ 2f=o 1*)^ ’ which can always be accomplished since majorizes 
(see Theorem 1 of [7] as well as Ref. [20]). An additional system S " is then introduced in state |0)'^ and an entangling incoherent 
operation is performed to obtain 

Thus at this point the state across all three systems 

l<A)^ !<?,„) = - ^ {W |i>") ® l;>"' ® \jY" ■ (59) 

^ /j=0 

An incoherent measurement is then performed on systems SS' given by 

Mjk = \jk} {{bjk\ ® f') = |I ® U^W^. (60) 

From Eq. (59), we see that outcome jk generates the (unnormalized) post-measurement state 


1 

- ^ ® \nf' 

m,n=0 

^ d-l 

" ^ Z ®\nf' 

m,n=0 

1 1 

= - \jk) ® ^ ^ \mf" 

^^\jk)®W*,U\iJrf' . (61) 

Therefore, after applying the (incoherent) rotation W*^. on system 5", the state U It//} is obtained. □ 
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Remark 1. Lemma 2 can easily be extended to performing controlled unitaries of the form |r)(r|‘' ® Ur, where C is the 
control system of dimension R. When each Ur acts on a t/-dimensional system, the amount of CoBits needed to perform this 
operation is given by flog d\. Indeed, the above protocol is repeated with a replacement in Eq. (60) of an incoherent measurement 
performed on systems CSS' given by 


R 

Mjk = 2 \r){rf ® \jk) [{bjk\ Uf ® f') . (62) 

r=l 

Remark 2, Lemma 2 offers an alternative protocol for coherence dilution in pure states [7]. For an arbitrary pure state = 
Zli \x), we can transform |+)" at any rate R < 5(X)a(i/,) as « —> cm. To see this we consider the n-copy 

decomposition of \^) into its typical and atypical parts [7]: 

l-A)®" = Itypical) + latypical). (63) 

Since and —> 1 for arbitrary 5 > 0 and as n ^ oo, dilution is achieved by performing a unitary 

operation that rotates |+)" to |typical). Since |typical) is an element in a J-dimensional space. Lemma 2 implies that such a 

unitary can be implemented by incoherent operations at a coherence consumption rate arbitrary close to S (2f)A(i/,)- 

Lemma 3. Let and be two bipartite pure states with squared Schmidt coefficients being T(i/f) and f((p) respectively. 
Suppose that Alice and Bob’s incoherent bases are Schmidt bases for both and and suppose that '?(0) majorizes t(^) 
(i.e. < f{(p)). Then there exists an LIOCC protocol transforming with probability one. 

Proof. Recall that a probability distribution y = (y i, ■ ■ ■ , y^) majorizes another distribution x - {xi,-- ■ , Xn) if y^ ^ 
for all A: = 1, • • • ,«, where yj is the components of y in non-increasing order and likewise for .rj. Without loss of generality, 
suppose that both and are maximally correlated (i.e. have the form |i/f) = 2,- \ii) and \(p) = 2,- Pad f{ij/) 

with enough zeros so that f{ij/) and f{(p) are real vectors of equal length. Since f{i//) < T{(f>), there exists a doubly stochastic 
matrix D such that fiijj) - Df((f>) [58]. Birkhoff’s Theorem assures that D - YiaPa^a, where the form a probability 
distribution and the IIq, are permutation matrices. Then define the operators Mq, := Vpffll^ • S, where the elements of S 
are given by [[5]],y = V^/ and denotes the Hadamard product. Recall that the Hadamard product of two matrices 
A and B is the matrix A • B with elements [[A • B]],j = [[A]], 7 [[B]],j. Note that each Ma is an incoherent operator. By 
construction Me ® Ho. \i//} cc |(^) for every a, and the relation = 2^, paniyf(f) readily implies that 2 q. M^Ma - I. Hence, the 
protocol consists of Alice performing the incoherent measurement {Ma]a, announcing her result, and then Bob performing the 
permutation Hq,. □ 


Proof of Theorem 4 


Theorem 4. The function is an LIOCC monotone. 

Proof. By the convex roof construction, it suffices to prove monotonicity for pure state transformations [34]. To do so, we first 
introduce two relative entropy quantities for a general density matrix on system S and a bipartite state on joint system 
AB: Crip^) = mino-s£j5(p‘^||cr^) [3] and = mino-Ae^Qj 5(p'^®||cE'*) [33], where I is the set of incoherent states for 

system S and QI is the set of quantum-incoherent states for system AB. For a pure state with reduced density matrices ifA 
and tp^, these quantities reduce to Cr(tf^) — S (A)a((o) - E(^), Cr(tp^) - S (B)a((^) - E((,o), and - S (B)a((^)- Furthermore, 

it was shown in Ref. [33] that Cf for pure states, where '*(p'^*) the optimal asymptotic rate of coherence distillation 
on Bob’s side when Alice helps. Collecting these observations we therefore obtain 

Cx(/^) = Cf'^(/^) + C,(/) - Cf'^(/^) + Cr(ip\ (64) 

Now suppose that in the first round of the protocol, Alice makes a local measurement on the joint state that gen¬ 
erates an ensemble of pure state transformations {\cok}^^, Pk}- Since is an LIOCC monotone, we have 

C^I*('I"^^) ^ Yik PkCf^^iatf^), and likewise because Cr (for Alice’s system) is a monotone under Alice’s incoherent opera¬ 
tion, we have CrCi"^) > Yik PkCricof). Hence Cx('I''^*) > Tik PkC_c{<xf^^). When Bob measures in the next round, we repeat the 
same argument on each and use the fact that Cx(w^^) = Cf''^(w^^) + Cr{a>f). By iteration, Cx behaves monotonically for 
all rounds of the protocol, and the theorem is proven. □ 
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Proof of Theorem 5 

Theorem 5. For a pure state the following triples are achievable coherence-entanglement distillation rates 

(RA,RB,En = (5(X)Am - Em, 5(F)Am, 0) (65) 

{RA,RB,En = (0, 5(y|X)A(^), I(X : ( 66 ) 

as well as the points obtained by interchanging A <-> B in Eqns. (65) and ( 66 ). Moreover, these points are optimal in the sense 
that any achievable rate triple must satisfy 


Ra+Rb< = 5 (X)a(,,) + 5(F)a(>p) - E(T), R^ + < 5(7)^^. (67) 


Proof. The upper bound Ra+Rb 5 follows from monotonicity of Cx under LIOCC and the fact that Cx is asymptotically 

continuous. The latter property holds because Cx is defined on pure states in terms of relative entropy measures and extended to 
mixed states using a convex roof (see [59]). The upper bound Rb + £“ < S (7)a(>p) follows from the first and again the fact that 
one eCoBit can be transformed into one local CoBit using LIOCC. 

Moving to achievability, we first prove the rate triple given in Eq. (65). The protocol is based on decomposition (42d). Starting 
from I'P)®" expressed in this form, Alice first measures the typical type encoded in register Ai and (with high probability) 
announces the result f. This leaves them with the state 




VM,(1 - e) 


^ \m) 

m=\ 


A2 




r.,(l-3e) C, 


Alice next the performs the incoherent measurement on A 3 A 4 where 


K,c = mHycWl 


( 68 ) 


/ (c^l)(c'-l) \ 

where \yc) - 2^=1 e ' / |c'). She announces the result (I, c), and then Bob performs the incoherent operation ® 

/ (c-l)(c'-l) \ 

I*^) to systems BB 2 followed by the error-correction unitary Uc - e ' z |c') performed on 52 - The output state is 


1 

VM,(1 - e) 


Z 


m=\ 


C, 

|m)^^ ^ |c)^M<ArZ„co>^ 

C=1 


(69) 


Bob can further distill \4iti0cf) ^ 1+)™ with r —> 5(7|7f)A(>r) as n ^ 00. The total rates of coherence distillation thus approach 
Eq. (65). 

We next turn to the achievability of rate triple Eq. ( 66 ). It is based on decomposition (42b). Alice first does a type measurement 
and with high probability will generate the post-measurement state 


1 


z 


/=1 


Cr 

C=1 


Alice then measures the code block I on register A 2 and communicates the result to Bob. He then performs the incoherent unitary 
W,i with the permutation H,^^ conditioned on |c)^T This generates the state 


1 


Z 




(70) 


which asymptotically approaches the desired rates of Eq. ( 66 ). 


□ 


Remark 3. As noted in the main text, it is still unknown the optimal rate in which eCoBits can be distilled from a pure state 
using LIOCC. Rate triple ( 66 ) gives a rate of I{X : T)a(>p) with an additional coherence output rate of S (T|2f)A(>p). However, this 
point is not optimal in terms of the eCoBit rate. The reason is that the quantity I(X : T)Acr) can be increased by LIOCC. As an 
example of this effect, consider the state 

10 ) ® (| 0 > + ID + | 2 » + ID ® (10) - ID + |2))). 
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It can be seen that I(X : F)a('j') = 0. However, when Bob performs the incoherent measurement described by Kraus operators 
[Kq = |0)(+| + |1)(2|, Ki - |0)(-|), correlations are generated by measuring in the incoherent bases after Bob obtains outcome 
Kq. Hence, this state has a nonzero eCoBit distillation rate. 

We will now show that optimizing the mutual information I{X : F)A(ir) over all LIOCC protocols yields the maximum eCoBit 
distillation rate 

Lemma. For a pure state the optimal distillation rate of IOab) is given by 



(71) 


m 


where the supremum is taken over all LIOCC protocols that generate the multi-outcome transformation |'l')®" ^ {p{in), |'l'm))^j^j. 

Proof First let us prove sufficiency. Consider any LIOCC protocol £. that generates the pure state transformation 'F®" ^ 
Fix arbitrary e,6 > 0. We consider t blocks of 'F®" and perform £. on each of the blocks. This is a standard 
technique used in quantum Shannon theory, and is often called “double blocking”. For t sufficiently large, with probability > 1 -e 


the state obtained is with \Nm - p(m)\ < 6. This follows from the definition of d-typicality and Eq. (12). On each 

'F^/^'" Alice and Bob perform the distillation protocol of Theorem 5, thus generating the state <Tm where ||cr„, - ^ 




and Rm = I(X : F)a('p,„). Hence in total we have the transformation T*®"' ^ where || cr^ - q)®L»h^'"(^"' 

from which we compute the rate 



(72) 


m m 


where the additional terms come from Nm > p(m) - 6 and the removal of [-J. 

We now turn to the converse. Consider any LIOCC distillation protocol transforming T*®" —> 'ZimPi'^)Pm such that 
F(Yj,„ p{m)pm, *5®^^) > 1 - e (where need not be pure). Hence, 


{I - ef <Y,P(’n)F(p,„,<:>fs? 


m 


< Y^p{m)F(Mp,„),A{(^%)f. 


(73) 


m 


Using Fannes’ Inequality (Lemma 1), monotonicity of the trace norm under CPTP maps, and the relation F(p, erf < 1 - ^||p - 
cr||j, it is straightforward to show that 



where we have also used the fact that I(X : F)a($^®) = ^R. Combining with Eq. (73) gives 



Therefore, we obtain 


i 2 p(m)I{X : F)a(p„.) >R-3 logd^dBrni - (1 - - l/«. 


m 


This completes the proof 


□ 


Proof of Theorem 6 


Theorem 6. 


rLWCC^ 


('F) = ECT-). 


(74) 
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Proof. The protocol is based on decomposition (42c). Quite simply, Alice measures the typical type \t)^' and codebook |/) 
With high probability the post-measurement state will take the form 





C, c=i 




(75) 


This is a maximally entangled state of approaching the desired size of Ct —> E('T) as n ^ oo. 


□ 


Proof of Theorem 7 

Theorem 7. A mixed state has distillable entanglement if and only if entanglement can be distilled using LIOCC. 

Proof. Note that an arbitrary quantum operation can be accomplished by unitary operations and incoherent projective measure¬ 
ments. Thus, if X is a general LOCC operation such that X(p®") ~ then, because of Lemma 2, there exists an LIOCC 
operation X/ consuming some finite amount of local coherence that transforms X/(p®") = X(p®") ~ Therefore, to asymp¬ 
totically distill entanglement from p by LIOCC, it suffices for Alice and Bob to first have a sufficient amount of local coherence. 
Theorem 2 of Ref. [33] implies that local coherence for either Alice or Bob can be distilled from using LIOCC whenever 
p^^ is entangled (see Remark below). Hence, Alice and Bob first use ha copies of p^^ to distill a sufficient amount of local 
coherence for Alice and an additional vib copies to distill sufficient coherence for Bob. They can then implement X/ on p®" with 
high precision, thus generating a close of approximation of <[), 4 b using LIOCC operations and + rig + n copies of p. □ 

Remark 4. Ref. [33] deals with a more general setting in which the assisting party can perform arbitrary quantum operations. 
However, the projective POVM described in Theorem 2 of [33] can be implemented incoherently. Indeed, if, say Alice, performs 
any projective measurement {\bk){bk\]'lzl with the \bk) being orthonormal, Bob’s post-measurement state will be the same if Alice 
were to instead perform the incoherent projective measurement {\k){bk\YlZQ. See also Ref. [38]. 



